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Mycobacterium bovis strain AN5 has been used to produce purified protein derivative (PPD) for the intradermal test for bovine 
tuberculosis since it was introduced in 1948. This work reports the draft genome sequence of M. bovis AN5, which is used for the 
production of bovine PPD in Brazil, as well as comparisons to other strains of M. bovis and Mycobacterium tuberculosis. 
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ycohacterium bovis is the causative agent of bovine tubercu- 
losis, a disease that accounts for annual losses of $3 billion 
(1) and that poses a threat to public health and animal welfare (2, 
3). In Brazil and many other countries, the program for control- 
ling bovine tuberculosis involves testing cattle with an antigen 
preparation of the M. bovis AN5 strain, namely, purified protein 
derivative (PPD), followed by slaughtering positive reactors. This 
strain produces a high yield of cell mass on glycerinated medium, 
a desirable phenotype that was selected by repeated subculture of 
the bacillus on laboratory medium (4). The draft genome of the 
M. bovis AN5 strain used in BrazQ for PPD production is reported 
in this paper. Comparisons of the genes coding for PPD proteins 
m other strains of M. bovis (AF2122/97 and 04-303) andMycobac- 
terium tuberculosis H37Rv are also described. 

The genome sequence was obtained using MiSeq technology 
(5), producing a total of 3,240,633 paired-end reads. After filter- 
ing, 2,269,762 paired-end reads and 372,399 single-end reads were 
used in the assembly. We performed a reference-assisted genome 
assembly of the filtered data for M. bovis AN5 with the M. bovis 
AF2 122/97 strain (GenBank accession no. NC_002945) using 
Bowtie (6). The assembly contains 70 contigs (with sizes no 
shorter than 500 bp) and an N^q of 150,219. Annotation for the 
M. bovis AN5 genome was obtained by using the NCBI Prokary- 
otic Genome Annotation Pipeline (7) and comprises 3,938 coding 
sequences (CDSs) and 48 RNA genes (3 rRNAs and 45 tRNAs). 
Out of the 3,938 CDSs, 2,830 (72%) have a functional assignment. 

A comparison of the genes coding for proteins found in bovine 
PPDs produced in the United Kingdom (PPDUK) and in Brazil 
(PPDBR) by liquid chromatography- tandem mass spectrometry 
revealed a total of 116 proteins. Of these, 67 were found only in 
PPDUK, 12 were found only in PPDBR, and 37 were identified in 
both samples (8). This fact suggests that the Brazilian M. bovis 
AN5 isolate might be missing some PPD genes or that the genes 
contain a high degree of variation. Genes coding for all 1 16 pro- 
teins found in both PPDs were compared to their orthologs in 



M. foov!sAF2 122/97 (1), M. bovis 04-303 (9), andM. tuberculosis 
H37Rv (10) and showed a high degree of conservation, with iden- 
tities ranging from 98% to 100%. In addition, no frameshifts were 
detected. Genes coding for the immunodominant proteins 
MPB64, MPB70, MPB83, GroEL, GroES, HspX, CFP-10, and 
CFP-21 are 100% conserved among the orthologs. The gene cod- 
ing for ESAT-6 showed 100% identity with orthologs in M. tuber- 
culosis H37Rv and M. bovis AF2122/97 strains, and 99% identity 
with the ortholog in M. bovis 04-303. The gene coding for MPB53 
showed 100% identity with orthologs in M. bovis 04-303 and 
M. bovis AF2 122/97 and 99% identity with that in M. tuberculosis 
H37Rv. In conclusion, neither gene deletion nor gene variation in 
M. bovis AN5 strains used for PPDBR supports the proteomic 
differences compared with PPDUK. 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der accession no. AWPLOOOOOOOO. The version described in this 
paper is version AWPLOIOOOOOO. 
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